On the Linear-cost Subtree-transfer Distance between Phylogenetic Trees (revised Version of Dimacs Technical Report 97-18) 1
نویسندگان
چکیده
Di erent phylogenetic trees for the same group of species are often produced either by procedures that use diverse optimality criteria [16] or from di erent genes [12] in the study of molecular evolution. Comparing these trees to nd their similarities and dissimilarities (i.e. distance) is thus an important issue in computational molecular biology. Several distance metrics including the nearest neighbor interchange (nni) distance and the subtree-transfer distance have been proposed and extensively studied in the literature. This article considers a natural extension of the subtreetransfer distance, called the linear-cost subtree-transfer distance, and studies the complexity and e cient approximation algorithms for this distance as well as its relationship to the nni distance. The linear-cost subtree-transfer model seems more suitable than the (unit-cost) subtree-transfer model in some applications. The following is a list of our results. 1. The linear-cost subtree-transfer distance is in fact identical to the nni distance on unweighted phylogenies. 2. There is an algorithm to compute an optimal linear-cost subtree-transfer sequence between unweighted phylogenies in O(n 2O(d)) time, where d denotes the linear-cost subtree-transfer distance. Such an algorithm is useful when d is small. 3. Computing the linear-cost subtree-transfer distance between two weighted phylogenetic trees is NP-hard, provided we allow multiple leaves of a tree to share the same label (i.e. the trees are not necessarily uniquely labeled). 4. There is an e cient approximation algorithm for computing the linear-cost subtree-transfer distance between weighted phylogenies with performance ratio 2.
منابع مشابه
Negative Cycles inWeighted Digraphs
1. Is there a constant ratio approximation algorithm for the nni distance on unweighted evolutionary trees or is the O(log n)-approximation the best possible? 2. Is the linear-cost subtree-transfer distance NP-hard to compute on weighted evolutionary trees if leaf labels are not allowed to be non-unique? 3. Can one improve the approximation ratio for linearcost subtree-transfer distance on weig...
متن کاملSupertrees Based on the Subtree Prune-and-Regraft Distance
Supertree methods reconcile a set of phylogenetic trees into a single structure that is often interpreted as a branching history of species. A key challenge is combining conflicting evolutionary histories that are due to artifacts of phylogenetic reconstruction and phenomena such as lateral gene transfer (LGT). Many supertree approaches use optimality criteria that do not reflect underlying pro...
متن کاملChain Reduction Preserves the Unrooted Subtree Prune-and-Regraft Distance
The subtree prune-and-regraft (SPR) distance metric is a fundamental way of comparing evolutionary trees. It has wide-ranging applications, such as to study lateral genetic transfer, viral recombination, and Markov chain Monte Carlo phylogenetic inference. Although the rooted version of SPR distance can be computed relatively efficiently between rooted trees using fixed-parameter-tractable algo...
متن کاملSPR Distance Computation for Unrooted Trees
The subtree prune and regraft distance (d(SPR)) between phylogenetic trees is important both as a general means of comparing phylogenetic tree topologies as well as a measure of lateral gene transfer (LGT). Although there has been extensive study on the computation of d(SPR) and similar metrics between rooted trees, much less is known about SPR distances for unrooted trees, which often arise in...
متن کاملA 3-factor approximation algorithm for a Minimum Acyclic Agreement Forest on k rooted, binary phylogenetic trees
Molecular phylogenetics is a well-established field of research in biology wherein phylogenetic trees are analyzed to obtain insights into the evolutionary histories of organisms. Phylogenetic trees are leaf-labelled trees, where the leaves correspond to extant species (taxa), and the internal vertices represent ancestral species. The evolutionary history of a set of species can be explained by...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997